Voice controlled multimedia and communications device

ABSTRACT

A portable multimedia and communications device can include a transducive element for receiving sound. The device also can include a base unit having a plurality of multimedia units and a processor executing a speech recognition engine for recognizing user speech. Each of the plurality of multimedia units can be selectively enabled and operated responsive to user voice commands received via the transducive element and communicated to the base unit via a communication link.

BACKGROUND OF THE INVENTION

1. Technical Field

The present invention relates the field of portable communicationsdevices.

2. Description of the Related Art

As consumers continue to demand mobile access to many computing andcommunications services, technology continues to evolve to accommodateconsumer demand. Ideally, consumers prefer to have access to most, ifnot all, of the same computing functions presently available at home orthe office despite being away from such resources. In response toconsumer demand, a wide variety of portable electronic devices havebecome commercially available. For example, individuals can track andstore a variety of personal information using conventional personaldigital assistants, communicate with friends, family, and businesscontacts using cellular telephones, and listen to music through portableradios, compact disc (CD) players, and/or Motion Picture. ExpertsGroup-1 Audio Layer 3 (MP3) players. A variety of other portable devicessuch as portable short distance radios and global positioning system(GPS) receivers are available to consumers as well.

The majority of commercially available portable devices are sold assingle function units. That is, the consumer typically must purchase amusic player separately from a cellular telephone, or a citizen bandradio separately from a GPS receiver. Typically, consumers must carry aplethora of electronic devices should the consumer wish to travel withcalendaring, music, communications, and the like. Even in cases whereone or more of these devices are available in various combinationswithin a single unit, frequently, no single combination device is ableto meet the consumer's full range of needs.

Although each device may not be burdensome to operate alone, when usedin combination with one or more other devices, the consumer can beoverwhelmed with connections, wires, cables, and attachments. To betterorganize the various electronic devices a consumer may carry, severalvarieties of carry-alls or electronic device organizers are available.Still, as the benefit of using a portable electronic device is theimmediacy and ease with which the different functions of the device canbe accessed, storing the device in a carry-all may not prove to beconvenient. For example, by the time a consumer is able to locate aringing cellular telephone within a carry-all, the call may be missed.By the time one finds a personal digital assistant within a carry-all,the opportunity to use the sought after information may have passed.

Assuming that a consumer is able to carry each of the various portableelectronic devices that the consumer needs or desires, the consumerstill must have one or more hands free in order to operate any givendevice. With respect to travelers, however, this is not often the case.More than likely, travelers use one or both hands to carry luggage, acarry-on bag, or possibly a personal digital assistant. To operate anyof the traveler's portable electronic devices, the traveler may have tostop, put down one or more bags, search for the device, and only thenproceed to use the desired device.

Other users, for example individuals who enjoy walking, jogging, hiking,bicycling, motorcycling, boating, or other activities, also rely uponportable electronic devices such as GPS units, music players, cellulartelephones, voice memo pads, and the like when engaged in sports orother outdoor activities. Such individuals, however, not only must haveone or more free hands to operate a given device, but also must be ableto divert their attention away from an ongoing activity to operate thedevice. For example, the user typically must watch a dial or display asa radio is tuned, as a volume is adjusted, and the like. The use of oneseyes and hands, however, can cause a break in rhythm and concentrationand may even cause an accident.

SUMMARY OF THE INVENTION

The present invention provides a solution for persons requiring accessto a variety of multimedia, communications, and computing functions froma single electronic device. In particular, the present inventionprovides a portable electronic device which can include one or more ofan AM/FM radio, a music player, a short distance radio, a voice memopad, a cellular telephone, a global positioning system (GPS) receiver,an AM/FM radio interface, and a transponder (hereafter collectivelyreferred to as “multimedia units”). Importantly, each of the variousmultimedia units not only can be included within a single, portabledevice, but also can be operated in a hands-free manner through the useof voice commands and speech recognition technology. Although the devicecan be embodied in a variety of forms, according to one inventivearrangement, the device can be modular, wearable, and weather resistant.

One aspect of the present invention can include a portable multimediaand communications device. The device can include a transducive elementfor receiving sound. According to one arrangement, the transduciveelement can be disposed in a first headset unit which can include asound generation source and a short range wireless transceiver. Thedevice also can include a base unit having a plurality of multimediaunits and a processor executing a speech recognition engine forrecognizing user speech. Notably, the speech recognition engine can beswitched between a speaker-independent operational mode and aspeaker-dependent operational mode responsive to a control signal, forexample a user spoken utterance.

The device can be voice enabled such that each of the multimedia unitscan be selectively enabled and operated responsive to user voicecommands received via the transducive element and communicated to thebase unit via a communication link. Notably, the base unit can include asecond short range wireless transceiver for communicating with the shortrange wireless transceiver of the headset unit. Accordingly, thecommunication link can be a wireless communication link. Still, thetransducive element, or the headset unit depending upon the particularembodiment of the present invention, can be connected to the base unitvia a wired connection.

According to one aspect of the present invention, responsive toactivating one of the plurality of multimedia units, the processor candisable another active one of the plurality of multimedia units bylowering an audio volume of the other active multimedia unit or pausingoperation of the other active multimedia unit. Responsive to terminatinguse of the activated multimedia unit, the processor can re-enable theother active one of the plurality of multimedia units.

For example, one of the multimedia units can be a wireless telephone.Thus, responsive to the wireless telephone receiving a telephone call,the processor can disable an active one of the multimedia units. Forinstance, the processor can disable an active multimedia unit bylowering an audio volume of the active multimedia unit or by pausingoperation of the active multimedia unit. Responsive to termination ofthe telephone call, the processor can re-enable the active multimediaunit.

Taking another example, a second one of the multimedia units can be ashort distance radio. In that case, responsive to the short distanceradio receiving a wireless communication, the processor can disable anactive one of the multimedia units. The processor can disable an activemultimedia unit by lowering an audio volume of the active multimediaunit or by pausing operation of the active multimedia unit. Responsiveto termination of the wireless communication, the processor canre-enable the active multimedia unit.

Still, the present invention can include additional multimedia units.For example, the present invention can include, but is not limited to,multimedia units such as a GPS receiver, an AM/FM radio tuner, an audioplayer, a compact disc player, a digital software-based media player, acassette player, a voice recorder memo pad, an AM/FM radio interface,and a transponder.

Another embodiment of the present invention can include at least asecond headset unit configured similarly to the first headset unit. Thefirst headset unit and the second headset unit can communicate with oneanother through the base unit via wireless communication links.

Another aspect of the present invention can include a method ofoperating a voice activated, portable multimedia and communicationsdevice. The method can include receiving a user spoken utterance in atransducive element. The transducive element can be included within aheadset unit. The user spoken utterance can be transmitted to a baseunit via a wired connection or a short distance wireless communicationlink. The user spoken utterance can be speech recognized as a validvoice command. Accordingly, one of the multimedia units disposed in thebase unit can be selectively enabled responsive to identifying the userspoken utterance as a valid voice command. Notably, the method caninclude switching between a speaker-independent operational mode and aspeaker-dependent operational mode responsive to a control signal suchas a voice command.

The method can include activating one of the plurality of multimediaunits, and responsive to the activating step, disabling another activeone of the plurality of multimedia units. For example, the disablingstep can include lowering an audio volume of the other active multimediaunit or pausing operation of the other active multimedia unit.Responsive to terminating use of the activated one of the plurality ofmultimedia units, the other active one of the plurality of multimediaunits can be re-enabled.

According to one embodiment of the present invention, the voice commandcan activate an intercom operational mode. In that case, the method caninclude receiving a subsequent user spoken utterance in the base unitfrom the headset unit via a wireless communication link. The user spokenutterance can be forwarded to a second headset unit via a secondwireless communication link.

Still, one of the multimedia units can be a wireless telephone.Accordingly, the method can include receiving a telephone call in thewireless telephone and playing an audio notification through the headsetunit indicating that the call has been received. For example, the audionotification can play a name associated with the received telephone callor a calling telephone number of the received call. Notably, the audionotification can query the user whether to answer the received telephonecall. If another one of the multimedia units is active when thetelephone call is received, the method further can include, responsiveto receiving the telephone call, disabling the active multimedia unit.The active multimedia unit can be disabled by lowering an audio volumeof the active multimedia unit or by pausing operation of the activemultimedia unit. Responsive to terminating the telephone call, theactive multimedia unit can be re-enabled. The method also can includedetermining that a calling number for the received telephone call is notincluded in a contact list and querying a user whether to save thereceived call information.

According to another embodiment of the present invention, one of themultimedia units can be a short distance radio. In that case the methodcan include receiving a short distance radio communication in the shortdistance radio and providing an audio notification to the headset unitindicating that the short distance radio communication has beenreceived. For example, the user can be queried as to whether thereceived short distance radio communication should be answered. Ifanother one of the multimedia units is active when the short distanceradio communication is received, the method also can include, responsiveto receiving the short distance radio communication, disabling theactive multimedia unit. For example, the active multimedia unit can bedisabled by lowering an audio volume of the active multimedia unit or bypausing operation of the active multimedia unit. Responsive toterminating the short distance radio communication, the activemultimedia unit can be re-enabled. Notably, if one of the plurality ofmultimedia units is a wireless telephone and another one of theplurality of multimedia units is a short distance radio, the method caninclude exchanging audio between the wireless telephone and the shortdistance radio.

BRIEF DESCRIPTION OF THE DRAWINGS

There are shown in the drawings embodiments which are presentlypreferred, it being understood, however, that the invention is notlimited to the precise arrangements and instrumentalities shown.

FIG. 1 is a schematic diagram illustrating a voice-controlled multimediaand communications device in accordance with the inventive arrangementsdisclosed herein.

FIG. 2 is a flow chart illustrating a method of communicating betweenthe headset unit and the base unit of the voice-controlled multimediaand communications device of FIG. 1.

FIG. 3 is a flow chart illustrating a method of receiving and processingwireless communications in accordance with the present invention.

DESCRIPTION OF THE INVENTION

The present invention provides a solution for persons requiringhands-free control over a variety of multimedia, communications, andcomputing functions contained within a single electronic device. Inparticular, the present invention provides a portable electronic devicewhich can include one or more of an AM/FM radio, a music player, a shortdistance radio, a voice memo pad, a cellular telephone, a globalpositioning system (GPS) receiver, an AM/FM radio interface, and atransponder (hereafter collectively referred to as “multimedia units”).Importantly, each of the various multimedia units can be operated in ahands-free manner through the use of voice commands. Although the devicecan be embodied in a variety of forms, according to one inventivearrangement, the device can be modular, wearable, and weather resistant.

FIG. 1 is a schematic diagram illustrating a voice-controlled multimediaand communications device (VCMCD) 100 in accordance with the inventivearrangements disclosed herein. As shown, the VCMCD 100 can include twoprimary components, a headset unit 105 and a base unit 110. The headsetunit 105 and the base unit 110 can communicate with one another via awireless communication link 115. The headset unit 105 can include aspeaker 120 or other sound generator, a transducive element 125, aprocessor 130, a wireless transceiver 135, and an antenna 140. Thespeaker 120 can be disposed in an earpiece portion of the headset unit105 to provide audio output when the headset 105 is worn by a user.Although not shown in FIG. 1, the headset unit 105 can includeadditional speakers so as to provide stereo sound. Accordingly, theheadset can include two earpiece portions, each having a speakerdisposed therein.

The transducive element 125, for example a microphone, can be attachedto the headset unit 105. For example, according to one embodiment of thepresent invention, the transducive element 125 can be located on an armwhich is rotatably attached to the earpiece so as to swing up and awayfrom a user's face when not in use. Still, the transducive element 125can be attached to the headset in any of a variety of different forms.For instance, the transducive element can be disposed within a clip-onapparatus which can clip onto a user's clothing. In that case, thetransducive element 125 can be communicatively linked to the headsetunit 105 via a wired connection. In the event the transducive element125 is located as a separate component from the headset unit 105, thetransducive element 125 can be communicatively linked to the base unit110 via a wireless or wired connection.

Continuing with FIG. 1, the speaker 120 and the transducive element 125can be communicatively linked to the processor 130. The processor 130can perform a variety of audio processing and routing functions, as wellas serve as an interface between the speaker 120, the transduciveelement 125, and the wireless transceiver 135. In particular, theprocessor 130 can perform analog-to-digital (A/D) conversions of audiosignals received from the transducive element 125 and performdigital-to-analog (D/A) conversions of audio so as to provide a suitableaudio signal to the speaker 120.

As mentioned, the processor 130 further can route signals among thevarious components of the headset unit 105. In particular, the processor130 can receive audio signals, control signals, and other dataoriginating from the base unit 110 through the wireless transceiver 135.The processor 130 can separate audio signals from the various controlsignals, D/A convert the audio, and provide the audio to the speaker120. Similarly, the processor 130 can A/D convert audio signals receivedfrom the transducive element 125, and provide the converted audio inaddition to any control signals and data to the wireless transceiver 135for transmission to the base unit 110.

It should be appreciated that the processor 130, although depicted as asingle general purpose processor, can be embodied as one or moreprocessors, including control processors and audio processors such asdigital signal processing (DSP) units. The speaker 120 and thetransducive element 125 can be communicatively linked to the processor130 via suitable circuitry. The processor 130 and the wirelesstransceiver 135 also can be communicatively linked via circuitry, forexample a communications bus capable of supporting two waycommunications.

The wireless transceiver 135 can serve as an interface to the base unit110. Specifically, the wireless transceiver 135 can receive signals fromthe processor 130 and convert the signals for use with a suitablewireless communications protocol for transmission. Similarly, thewireless transceiver 135 can receive signals from the base unit 110 anddecode the signals using an appropriate wireless communications protocolprior to forwarding the signals to the processor 130.

The wireless transceiver 135 can be embodied in a variety of forms. Forexample, the wireless transceiver 135 can be configured to communicatewith the base unit 110 using sound waves or infrared light. According toone embodiment of the invention, however, radio waves, that is radiofrequency (RF) communication can be used. Thus, the wireless transceiver135 can provide a signal to antenna 140 for transmission to the baseunit 110. The wireless transceiver 135 can be configured to communicatewith the base unit 110 using any of a variety of short range wirelesstechnologies and/or protocols. For example, the wireless transceiver 135can utilize BLUETOOTH technology, another technology of the 802.11family of wireless communication specifications, as well as otherproprietary technologies.

It should be appreciated, however, that the audio signal received fromthe transducive element 125 can be provided to the wireless transceiver135 and transmitted in analog form. Accordingly, after the audio signalis received by the base unit 110, the processor 150, to be described ingreater detail, can A/D convert the signal. In such an embodiment, audioand control signals transmitted from the base unit 110 to the headsetunit 105 can be transmitted in analog form as well. In that case, theheadset 105 can include additional analog processing circuitry,including analog audio processing circuitry.

According to another embodiment of the present invention, the speaker120, the transducive element 125, and the processor 130 can becommunicatively linked to the base unit 110 (to be described herein ingreater detail) via a suitable wired connection which can facilitate twoway communications among the headset unit 105 and the base unit 110.

On the exterior of the headset unit 105, one or more control buttonsand/or switches can be included. The controls can be hardwired toperform one or more predetermined functions, or can be programmedthrough the base unit 110 to perform one or more of a variety ofdifferent functions. Alternatively, a software application executing onanother computer system can be used to configure the VCMCD 100 using acharging cradle or other data link which communicatively links theheadset unit 105, the base unit 110, and the separate computer system.Regardless, the controls can be assigned to perform functions such asinterrupting the processor 130 so as to give precedence to audio inputreceived from the transducive element 125, dialing 911, sending anemergency signal over the short distance radio, sending a transpondersignal, or awakening the VCMCD 100 from a power save or sleep mode. Forexample, activating a control can notify the VCMCD 100 that speech tofollow specifies a name for voice dialing or is to be recorded as avoice memo. Still, one or more controls, whether permanently configuredor software configurable, can be included for accessing any of thevarious functions to be described herein with reference to the base unit110. For example, one or more control buttons can be dynamicallyassigned a function according to the operating state of the VCMCD 100.

The base unit 110 of the VCMCD 100 can include a wireless transceiver145, a processor 150, a compact disc (CD) player 155 or other musicsource, a short distance radio 160, a wireless telephone 165, an AM/FMradio tuner 170, a GPS receiver 175, an AM/FM radio interface, as wellas a transponder 199. The wireless transceiver 145 can be matched to thewireless transceiver 135 of the headset unit 105 so as to supporttwo-way wireless communications via antenna 180 as previously discussed.The transceiver 145 can be communicatively linked to the processor 150via suitable interface circuitry. Thus, the transceiver 140 can receivedata from the processor 150, format data using the appropriatecommunications protocol, and send data to the headset unit 105 via thewireless communication link 115. Similarly, data received from theheadset unit 105 over the wireless communication link 115 can be decodedand provided to the processor 150.

The processor 150 can be communicatively linked to the CD player 155,the short distance radio 160, the wireless telephone 165, the AM/FMradio tuner 170, the GPS receiver 175, the AM/FM radio interface 197,and the transponder 199 via suitable circuitry such as a communicationsbus 195. The communications bus 195 can support two way communicationamong each of the respective components, thereby enabling the exchangeof control signals as well audio data.

The processor 150, similar to the processor 130 of the headset unit 105,can include one or more general purpose processors and/or DSP units. Theprocessor 150, however, can include a speech recognition engine 185. Forexample, the speech recognition engine 185 can be included in the memoryof processor 150 or can be included within an additional memory 190which is accessible by the processor 150. In any case, the speechrecognition engine 185 can receive digitized speech from the headsetunit 105 and convert the received speech to text. The speech recognizedtext can be processed to determine one or more voice commands which canbe executed by the processor 150.

Still, as discussed, the based unit 110 can receive analog signals fromthe headset unit 105 which can be decoded by the wireless transceiver145. In that case, the analog signals can be A/D converted by theprocessor 150 for further processing such as speech recognition.

The CD player 155 can be communicatively linked to the processor 150 viathe communications bus 195. Notably, additional audio sources such astape players and other digital medial players including, but not limitedto, minidisk players, Motion Picture Experts Group-1 Audio Layer 3 (MP3)players, and the like can be included in place of, or in addition to,the CD player 155. According to one embodiment of the present invention,one or more software-based audio players can be included within the baseunit 110.

The short distance radio 160 can be any of a variety of personal two-wayshort distance radio communications devices which can operate overdistances of less than one mile and up to distances of tens of miles.For example, the short distance radio 160 can be configured to operateover one of the Citizen's Band Radio Services, such as Citizen's Band(CB) or Family Radio Service (FRS). Alternatively, the short distanceradio 160 can be configured to operate over General Mobile Radio Service(GMRS), or some other short distance radio communications format.

The wireless telephone 165, for example a cellular telephone, the AM/FMradio tuner 170, and the GPS receiver 175 are commercially availablecomponents. Notably, the AM/FM radio tuner 170 can be programmed withone or more preset stations for both AM and FM bands. Each of thevarious preset stations can be accessed via a suitable voice commandsuch as “preset 1”, “preset 2”, “next preset up”, or “next preset down”.Each of the respective components can be communicatively linked to theprocessor 150 via the communications bus 195. The transponder 199 canprovide wireless communications for monitoring incoming signals orsending outgoing signals such as a “beacon” signal which can be trackedby third parties for purposes of safety. Although the transponder 199can include active components so that the transponder can receive andsend signals which can be detected over long ranges, the transponder 199also can include passive components enabling the transponder 199 to bedetected by sensors within ranges of several feet. Notably, thetransponder 199 can be configured to transmit coordinates as determinedby the GPS receiver 175 in cases of emergency or periodically dependingupon the VCMCD 100 configuration.

The AM/FM radio interface 197 allows the base unit 110 to be connectedto a car radio, for example, via an antenna input or other standardconnector available on conventional car radios. Responsive to connectingthe base unit 110 to a car radio via the AM/FM radio interface 197, theprocessor 150 can direct audio output through the communications bus 195to the AM/FM radio interface 197. The AM/FM radio interface 198 enablesa user to listen to audio received and generated by the VCMCD 100through a high quality car audio system rather than the headset unit105.

According to another embodiment of the present invention, the headsetunit 105 can be configured, or a separate and additional transduciveelement can be configured, without the speaker 120. As the audiooriginating from the base unit 110 can be played through the car audiosystem when the AM/FM radio interface 197 is engaged, no speaker need beincluded in the headset unit 105. For example, a wearable microphonesuch as a lapel microphone can be used. Such a microphone can bewirelessly connected to the base unit 110 as described. Alternatively,the microphone can be connected to the base unit 110 via a wiredconnection to a suitable audio jack disposed in the base unit 110 whichcan be connected to the processor 150 via suitable interface circuitry.

The processor 150 can receive speech recognized text from the speechrecognition engine 185 to compare against a grammar or vocabulary ofrecognizable words and/or voice commands. If a match exists, theprocessor 150 can determine an appropriate action. In particular,responsive to voice commands, the processor 150 can selectively enableand disable the CD player 155 or other audio source, the short distanceradio 160, the wireless telephone 165, the AM/FM radio tuner 170, andthe GPS receiver 175. Moreover, the processor 150 can control softwaredriven processes such as recording voice memos and the like responsiveto voice commands. Notably, not only can a user select a particulardevice through voice commands, but the user also can access the variousoperational parameters of each of the respective devices via voicecommands. An exemplary listing of voice commands is included in theAppendix.

As shown in FIG. 1, the short distance radio 160, the wireless telephone165, the AM/FM radio tuner 170, the GPS receiver 175, and thetransponder 199 each can be configured to include a separate antenna forsending and receiving wireless signals. It should be appreciated bythose skilled in the art, however, that one or more of these componentscan be configured to share a common antenna. For example, the wirelesstelephone 165 can be configured to share a general purpose antenna withthe short distance radio 160, or with the wireless transceiver 145.Regardless of the particular antenna configuration used, the examplesdisclosed herein are not intended as a limitation of the presentinvention.

A power source (not shown) can be included in the headset unit 105 andthe base unit 110. The power source can be one or more disposablebatteries or rechargeable batteries. For example, a rechargeable powersource can be recharged in a charging cradle similar to the cradlecommonly used with conventional personal digital assistants. Notably,both the headset 105 and the base unit 110 can be configured to chargewithin such as cradle. Still, other charging mechanisms can be used tocharge the power source of the VCMCD 100 such as wearable, flexiblephotovoltaic devices, regenerative braking charging units, or chargerswhich connect to an electrical connection such as a cigarette lighterreceptacle commonly used in motor vehicles. Accordingly, the base unit110 can draw power to operate as well as recharge a rechargeable powersource via such an electrical connection.

FIG. 2 is a flow chart illustrating a method 200 of communicatingbetween the headset unit and the base unit of FIG. 1. In step 205, theVCMCD can be powered on. The VCMCD can include a power switch on theheadset unit as well as the base unit. Accordingly, the headset unit andthe base unit can be powered on independently of one another. In analternative embodiment of the invention, the VCMCD can be powered on bya voice command. For example, the unit can remain in a low powerconsumption sleep mode such that a received voice command such as “poweron” can activate or awaken the VCMCD.

In any case, once the VCMCD is powered on, the VCMCD can be configuredto begin operation in either a speaker independent mode or a speakerdependent mode. In speaker independent mode, a generalized speaker modelcan be used for purposes of speech recognition. That is, the speakermodel is not tailored for use with any particular user. In speakerdependent mode, a speaker-specific voice model can be used whichemphasizes perceptually important speaker characteristics. Although theVCMCD can determine a speaker dependent model, the speaker dependentmodel also can be derived from a counterpart application and transferredto the base unit via a data link. The VCMCD can be configured to poweron in either speaker independent or speaker dependent mode throughvarious controls on the base unit, or using a software applicationconfigured to communicate with the VCMCD via a charging cradle, auniversal serial bus port, or other wired or wireless connection.

In step 210, the VCMCD can enter a power save mode awaiting user speech.If user speech is received, the method can continue to step 215. If not,however, the method can continue to loop until user speech is received.If user speech is received, the method can continue to step 220.According to one embodiment of the invention, the processor canimplement one or more noise filters and frequency filters so as toemphasize sounds within the frequency range of human speech. Thus, if asignal is received having a particular strength and within a particularfrequency range, the signal can be considered a speech signal. Notably,the transducive element can be suited for use with the frequency rangeof human speech as well as be equipped with a wind guard. Accordingly,user speech can be detected automatically and distinguished frombackground and environmental noise.

In an alternative embodiment of the invention, either the headset unit,the base unit, or both can be equipped with a control which awakens theVCMCD or serves as an indication that the user will speak. The user canactivate such a control prior to speaking. In any event, user speech canbe received through the transducive element and converted to an analogaudio signal to be provided to the headset processor.

Continuing with step 220, the analog speech signal can be processed fortransmission to the base unit. In particular, the analog speech signalcan be A/D converted by the headset processor and provided to theheadset transceiver. The headset transceiver 220 can prepare thedigitized speech signal according to a wireless communication protocolfor transmission to the base unit. Accordingly, in step 225, the headsetunit can transmit the speech signal to the base unit through thewireless communication link.

In step 230, the base unit can receive the audio data from the headsetvia the wireless communication link and the base unit wirelesstransceiver. The base unit wireless transceiver can decode the signaland provide the digitized audio signal to the base unit processor. Instep 235, the processor can perform speech recognition on the receivedaudio and determine whether the received audio is a valid voice command.In particular, the speech recognition engine can compare the recognizedtext to one or more possible commands specified in a command grammar orlist. If the speech is not a valid command, the method can continue tostep 210 and repeat as necessary. If the speech is a valid command, themethod can continue to step 240 to determine the operating statecorresponding to the received voice command.

In particular, the user can select any of a variety of operating modesincluding, but not limited to a voice memo mode, a CD player mode, ashort distance radio mode, a wireless telephone mode, an AM/FM tunermode, and a GPS mode. Each of the various operating modes can beassociated with a predefined command vocabulary. Exemplary voicecommands for each of the modes are illustrated in the Appendix. Forexample, the user can utter “radio on”, “CD on”, “communicator on”,“memo on”, or “locator on”, and once received, the VCMCD will enter theappropriate operating state. Accordingly, if the command “CD on” isreceived, the command grammar associated with operation of the CD playercan be activated. The command grammar for the CD player can includecommands for skipping tracks, fast forwarding, rewinding, playing,pausing, stopping, as well as voice commands for various modes ofcontinuous and random play.

It should be appreciated, however, that particular commands, for examplevolume commands, operating state commands, and power commands, canremain active no matter which state the VCMCD is operating. This enablesa user to change operating states during operation of any of themultimedia units disclosed herein. For example, while dictating a voicememo, the user can say “phone”, which can be a reserved voice commandwhich initiates the wireless telephone mode. Accordingly, once the VCMCDenters the wireless telephone mode, the user can access voice dialing orother telephone related functions using voice commands.

Other voice commands which can remain in the active vocabulary caninclude specialized communications functions. For example, emergency or“911” access can be accessible no matter which mode the VCMCD isoperating. Thus, the user can say “emergency 911” which can cause theVCMCD to initiate a cellular telephone call to 911 or anotherpredetermined number, for example one corresponding to an emergencyservice. Voice dialing can remain active at all times such that the usercan say “voice dial” and a contact name no matter which mode the VCMCDis operating. Notably, the VCMCD can be configured to transmitcoordinates with an emergency call. Similar functionality can beavailable using the short distance radio and transponder. For example,responsive to a voice command such as “emergency communication”, theVCMCD can transmit a call for help over one or more specified channelsor continuously over one or more specified channels using the shortdistance radio. The user's coordinates as determined by the GPS receiveralso can be transmitted through the short distance radio. Notably, theuser's location can be specified either by sending data or by playing atext-to-speech rendition of the GPS receiver determined coordinates.

FIG. 3 is a flow chart illustrating a method 300 of receiving wirelesscommunications in accordance with the present invention. The method canbegin in a state wherein the user is engaged in an activity such aslistening to a CD or audio file played by the audio source in the baseunit or is dictating a voice memo. Accordingly, in step 305, a wirelesscommunication can be received. For example, a telephone call can bedetected by the wireless telephone in the base unit or an incomingcommunication can be received by the short distance radio in the baseunit. Accordingly, in step 310, the particular multimedia unit havingreceived the incoming wireless communication can send a notification tothe base unit processor.

In step 315, the processor can instruct the current function to pause orotherwise stop. For example, the processor can pause the recording of avoice memo, stop the audio feed from the CD player or AM/FM radio tuner,or lower the volume of the multimedia unit currently playing. Similarly,if a telephone call is received during a short distance radiocommunication, or a short distance radio communication is receivedduring an ongoing telephone call, the audio of the current activefunction can be lowered so that an audio prompt provided by the VCMCDcan be heard by the user.

In step 320, the base unit processor can cause an audio notification tobe played to the user. For example, a tone can be played from a tonegenerator (not shown) to indicate that an incoming communication hasbeen received. Distinctive tones can be used to signify whether thereceived communication is a short distance radio communication or areceived wireless telephone call. Alternatively, the processor cansimply play an audio prompt which explicitly states that a telephonecall or a short distance radio communication has been received. Forexample, a recorded voice prompt can be played or text-to-speech (TTS)technology can be used to play prompts to the user. Still, the base unitprocessor can send a control signal to the processor of the headset toplay an audio prompt stored in the headset unit.

According to one embodiment of the invention, the audio prompt caninform the user of the identity of the calling party. For example, thecalling number can be played to the user. Additionally, if the callingnumber, or source identifier in the case of a short distance radiocommunication, is specified within a contact list stored within theVCMCD, then the contact name can be played. Notably, the contact namecan be played using TTS technology or, as the VCMCD can perform voicedialing, the recorded user sample of the contact's name can be played toindicate the identity of the calling party or source of the incomingwireless communication.

In step 325, a determination can be made as to whether the user wishesto respond to the received wireless communication. In particular, theuser can say “yes” or “no” to indicate whether the user wishes torespond to the incoming wireless communication. Notably, as mentioned,one or more “hot keys”, or keys the function of which changes accordingto the operational state of the VCMCD can be provided. For example,controls can be provided allowing the user to answer the incomingwireless communication or send the wireless communication directly tovoice mail. Still, another control can be used to ignore the incomingwireless communication and return directly to the previous functionwithout having to wait a predetermined amount of time and hear severalaudio prompts before the wireless communication is sent to voice mail.Accordingly, a user can select such a key to respond to a wirelesscommunication or simply ignore any audio prompts allowing thepredetermined response time to expire such that the incoming wirelesscommunication is ignored. Thus, if the user indicates that the incomingwireless communication is to be answered, the method can continue tostep 330. If, however, the user indicates that the incoming wirelesscommunication is to be ignored, the method can proceed to step 335.Notably, in the case where the wireless communication is not to beanswered, the VCMCD can notify the user whether a message has been left.In any case, it should be appreciated that any function which can becontrolled via a hardware control can be accessed also by a voicecommand.

Continuing with step 330, the VCMCD can switch operational modes fromcurrent mode of the VCMCD to either the short distance radio mode or thewireless telephone mode depending upon the particular wirelesscommunication received in step 305. Accordingly, in step 340, the VCMCDcan perform the functions necessary for the user to engage in shortdistance radio communications or a telephone call. Thus, audio receivedform the wireless communication can be provided to the headset unit andspeech received from the user can be transmitted to the caller orindividual sending short distance radio communication. In step 345,after the user has finished participating in the telephone call or theshort distance radio communications, the communication session can beterminated. For example, the user can say “end call”, “endcommunication”, or can activate a control on the headset or the baseunit. Notably, in the case of a wireless telephone call, the wirelesstelephone can automatically detect the termination of a call by theother participant. According to another embodiment of the presentinvention, the VCMCD can terminate an ongoing wireless communicationafter a predetermined time period has elapsed without any audio beingreceived from the participant or generated by the user.

In step 350, responsive to termination of the communication session, theVCMCD can switch operational modes back to the particular mode in whichthe VCMCD was engaged prior to receiving the wireless communication.Accordingly, the user can resume any paused and/or discontinuedactivity. It should be appreciated that the method 300 illustrated inFIG. 3 is for purposes of example only. Thus, if a wirelesscommunication is received when the VCMCD is not engaged in anotherfunction or the unit is in a sleep mode, the VCMCD still can query theuser as to whether the received wireless communication is to beanswered.

The VCMCD, as mentioned, can include a contact list and/or database.According to one embodiment of the present invention, the VCMCD candetermine that a calling telephone number of a received call does notexist in the user's contact list. Thus, the VCMCD can prompt the userwhether to store the calling number. If the user wishes to store thenumber, the VCMCD can query the user for the name of the contactcorresponding to the calling number. The name can be specified bydictating the spelling of the name letter by letter so that the letterscan be speech recognized or by dictating the entire name such that thename can be recorded or speech recognized and stored for use in voicedialing the number in the future. Notably, the recorded speech orrecognized text as provided to a TTS system can be used in audio promptsfor notifying the user of an incoming call from the newly stored name.

The present invention can provide a variety of different functions, eachof which can be accessed in a hands-free manner through one or morevoice commands, or through one or more controls and operationalstate-dependent function controls. Accordingly, although many of thefunctions of the VCMCD have been described in detail, the VCMCD canprovide a variety of additional functions. For example, according to oneembodiment of the present invention, the VCMCD can function as anintercom. In that case, the VCMCD can be configured to operate with twoor more headset units which can communicate with one another via asingle base unit. Thus, the present invention can be used bymotorcyclists or other active persons engaged in activities wherein theenvironment complicates communication between participants. Notably, theheadset units can be incorporated into helmets or other varieties ofprotective head gear.

The intercom function of the VCMCD further can make the variousfunctions of the VCMCD available to all users of a communicativelylinked headset. For example, the VCMCD can be placed in speakerindependent mode to receive voice commands from either user or can beplaced in speaker dependent mode to receive voice commands from oneuser. Regardless, audio from the CD player and/or audio source, theAM/FM radio tuner, the short distance radio, and the wireless telephonecan be made available to one or more headsets selectively. Notably,using the headsets, one or more users having access to a headset canparticipate in a given telephone call.

Another aspect of the present invention can include the ability tocommunicatively link the wireless telephone and the short distance radiosuch that audio received via the wireless telephone can be routed to theshort distance radio and visa versa. This enables the free exchange ofcommunications between short distance radio participants, one or moreusers of the VCMCD having headsets, and wireless telephone participants.

Notably, as the VCMCD includes a wireless telephone, standard telephonyfunctions such as conference calling also can be made available.Conference call functions can be accessed via one or more voicecommands. For example, responsive to a voice command such as “conferencecaller 1”, the VCMCD can place an existing call on hold, initiate a callto a contact “caller 1”, and conference caller 1 into the existing call.It should be appreciated that the particular method of establishing theconference call can vary, and that the number of conference callers tobe joined to an existing call need only be limited by the telephonyservice. For example, responsive to a voice command such as “conferencecaller 1 and caller 2”, the VCMCD can initiate a call to caller 1,establish the call with caller 1, place caller 1 on hold, and initiate acall to caller 2 to eventually conference caller 2 into the existingcall with caller 1.

Another aspect of the present invention can provide directional servicesto users. In particular, as the GPS receiver can continually determinecoordinates specifying the location of a user, those coordinates can beprovided to the base unit processor. According to one embodiment of thepresent invention, the processor can provide the coordinates to aservice which can be accessed via the wireless telephone. Directions toa user specified location then can be provided back to the base unit andplayed to the user. Alternatively, the base unit can store one or moredifferent maps in memory allowing the base unit processor to determine aroute to the user specified location and provide audible instructions asto how to follow the route.

Those skilled in the art will recognize that the invention disclosedherein can be embodied in a variety of different physical arrangements.Accordingly, depending upon the size of the various multimedia units tobe included within the present invention, the base unit can be combinedwith the headset unit. In particular, if larger multimedia units areexcluded in favor of more compact multimedia units, for example wherethe CD player is excluded in favor of an MP3 player, the VCMCD can beembodied as a single headset device capable of performing the variousfunctions disclosed herein.

The present invention can be realized in a combination of hardware andsoftware. Any kind of computer system or other apparatus adapted forcarrying out the methods described herein is suited. Aspects of thepresent invention also can be embedded in a computer program product,which comprises all the features enabling the implementation of themethods described herein, and which when loaded in a computer system isable to carry out these methods. Computer program in the present contextmeans any expression, in any language, code or notation, of a set ofinstructions intended to cause a system having an information processingcapability to perform a particular function either directly or aftereither or both of the following: a) conversion to another language, codeor notation; b) reproduction in a different material form.

This invention can be embodied in other forms without departing from thespirit or essential attributes thereof. Accordingly, reference should bemade to the following claims, rather than to the foregoingspecification, as indicating the scope of the invention.

APPENDIX

Exemplary voice commands for controlling various operational parametersof the voice controlled multimedia and communication device (VCMCD) asdisclosed herein are illustrated below. Notably, the various voicecommands listed herein can be amended to include a code, that if said incombination with the listed voice command, can cause the VCMCD toprocess the received speech using a speaker independent recognitionmodel rather than a user specific speech recognition model, and viceversa.

1. VCMCD (General).

-   -   a. DEVICE (ON/OFF). Turns entire VCMCD on or off.    -   b. DEVICE ON (code). Turns entire device on in a universal voice        mode if user programmed code is specified in combination with        DEVICE ON command.    -   c. VOLUME (UP/DOWN). Raises or lowers volume of presently        enabled multimedia unit.    -   d. STANDBY. The designated feature or the feature currently        operating can be placed on standby.        2. AM/FM Radio Tuner.    -   a. RADIO (ON/OFF). Enables or disables AM/FM radio tuner.    -   b. GO TO (AM/FM). Causes AM/FM radio tuner to switch to        specified band.    -   c. GO TO X PLAY. Causes AM/FM radio tuner to change specified        frequency on current band and play audio from the AM/FM radio        tuner.    -   d. SCROLL (UP/DOWN). Causes AM/FM radio tuner to search for a        next higher or lower frequency which is clearly received.    -   e. SCAN (UP/DOWN). Causes AM/FM radio tuner to scan frequencies        up or down until stopped by user.    -   f. STOP. Causes AM/FM radio tuner to stop scanning frequencies        and play the station to which the AM/FM radio tuner is presently        tuned.        3. Compact Disc (CD) Player.    -   a. CD (ON/OFF). Enables or disables CD player.    -   b. PLAY. Play disc in CD player beginning at first track.    -   c. PLAY TRAC X. Plays specified track of CD in CD player.    -   d. REPEAT DISC. Plays CD continuously until stopped by user.    -   e. REPEAT TRACK. Plays current track until stopped by user.    -   f. STOP. Stops playing current track.        4. Short Distance Radio.    -   a. COMMUNICATOR (ON/OFF). Enables or disables short distance        radio.    -   b. GO TO CHANNEL X. Switches short distance radio to designated        channel.    -   c. SCROLL (UP/DOWN). Causes short distance radio to scroll up or        down a single channel.    -   d. SCAN (UP/DOWN). Causes short distance radio to scan channels        up or down until stopped by user.    -   e. STOP. If received during SCAN operation, causes short        distance radio to stop scanning.        5. Memo Pad.    -   a. MEMO (ON/OFF). Enables or disables the memo pad function.    -   b. (START/STOP). Begins or terminates recording when in memo pad        function.    -   c. SEND TO X. Causes voice memo to be speech recognized and text        to be sent to designated contact or causes audio recording to be        played after connecting with another party over a wireless        communication link.        6. Wireless Telephone.    -   a. PHONE (ON/OFF). Enable or disable wireless telephone.    -   b. SAVE CONTACT. Saves current number, i.e. of received call, in        address list along with recorded name of contact or text name as        determined by speech recognition engine through character        recognition.        7. GPS Receiver/Finder.    -   a. LOCATOR (ON/OFF). Enable or disable GPS receiver.    -   b. DIRECTIONS TO X. Causes GPS receiver to note current position        and initiate audio directions to user specified location.    -   c. SOS. Sends wireless communication, i.e. wireless telephone        call to specified destination, short distance radio        communication, or transponder signal. Can specify GPS determined        coordinates.

1. A portable multimedia and communications device comprising: atransducive element for receiving sound; and a base unit comprising aplurality of multimedia units and a processor executing a speechrecognition engine for recognizing user speech, wherein each of saidplurality of multimedia units can be selectively enabled and operatedresponsive to user voice commands received via said transducive elementand communicated to said base unit via a communication link, wherein thebase unit is a portable electronic device, wherein said plurality ofmultimedia units comprise at least one multimedia unit selected from thegroup consisting of an AM/FM radio, a cassette player, a compact diskplayer, and a digital software-based media player, and wherein saidspeech recognition engine is switched between a speaker-independentoperational mode and a speaker-dependent operational mode responsive toa user spoken utterance.
 2. The device of claim 1, wherein saidtransducive element is disposed in a first headset, said first headsetunit further comprising a sound generation source and a short rangewireless transceiver, said base unit comprising a second short rangewireless transceiver for communicating with said short range wirelesstransceiver of said headset unit, and wherein said communication link isa wireless communication link.
 3. The device of claim 1, wherein saiddevice is voice activated, and wherein said plurality of multimediaunits comprise a voice memo pad.
 4. The device of claim 1, whereinresponsive to activating one of said plurality of multimedia units, saidprocessor disables another active one of said plurality of multimediaunits.
 5. The device of claim 4, wherein said processor disables saidanother active one of said plurality of multimedia units by lowering anaudio volume of said another active multimedia unit.
 6. The device ofclaim 4, wherein said processor disables said another active one of saidplurality of multimedia units by pausing operation of said anotheractive multimedia unit.
 7. The device of claim 4, wherein responsive toterminating use of said activated multimedia unit, said processorre-enables said another active one of said plurality of multimediaunits.
 8. The device of claim 1, wherein one of said multimedia units isa wireless telephone.
 9. The device of claim 8, wherein responsive tosaid wireless telephone receiving a telephone call, said processordisables an active one of said plurality of multimedia units.
 10. Thedevice of claim 9, wherein said processor disables an active one of saidplurality of multimedia units by lowering an audio volume of said activemultimedia unit.
 11. The device of claim 9, wherein said processordisables an active one of said plurality of multimedia units by pausingoperation of said active multimedia unit.
 12. The device of claim 9,wherein responsive to terminating said telephone call, said processorre-enables said active one of said plurality of multimedia units. 13.The device of claim 1, wherein a second one of said multimedia units isa short distance radio, and wherein plurality of multimedia unitscomprise an AM/FM radio interface.
 14. The device of claim 13, whereinresponsive to said short distance radio receiving a wirelesscommunication, said processor disables an active one of said pluralityof multimedia units.
 15. The device of claim 14, wherein said processordisables an active one of said plurality of multimedia units by loweringan audio volume of said active multimedia unit.
 16. The device of claim14, wherein said processor disables an active one of said plurality ofmultimedia units by pausing operation of said active multimedia unit.17. The device of claim 14, wherein responsive to terminating saidwireless communication, said processor re-enables said active one ofsaid plurality of multimedia units.
 18. The device of claim 8, wherein asecond one of said multimedia units is a global positioning (GPS)receiver.
 19. The device of claim 8, wherein a second one of saidmultimedia units is an AM/FM radio tuner, and wherein a third one ofsaid multimedia units is a digital software based media player.
 20. Thedevice of claim 8, wherein a second one of said multimedia units is anaudio player.
 21. The device of claim 20, wherein said audio player isselected from the group consisting of a compact disc player, a digitalsoftware-based media player, and a cassette player.
 22. The device ofclaim 8, wherein a second one of said multimedia units is a voicerecorder memo pad.
 23. The device of claim 2, further comprising: atleast a second headset unit configured similarly to said first headsetunit, wherein said first headset unit and said second headset unitcommunicate with one another through said base unit via wirelesscommunication links.
 24. The device of claim 1, said base unit furthercomprising: an AM/FM radio interface for providing audio from said baseunit to a radio.
 25. The device of claim 1, said base unit furthercomprising: a transponder for locating said device.
 26. A method ofoperating a voice activated, portable multimedia and communicationsdevice comprising: receiving a user spoken utterance in a transduciveelement; transmitting said user spoken utterance to a base unit via acommunication link; speech recognizing said user spoken utterance as avalid voice command; selectively enabling one of a plurality ofmultimedia units disposed in said base unit responsive to identifyingsaid user spoken utterance as said valid voice command, wherein the baseunit is a portable electronic device, and wherein said plurality ofmultimedia units comprise at least one multimedia units selected fromthe group consisting of an AM/FM radio, a cassette player, a compactdisk player, and a digital software-based media player; and switching aselectively-enabled multimedia unit between a speaker-independentoperational mode and a speaker-dependent operational mode in response toa user spoken utterance identified as a valid voice command.
 27. Themethod of claim 26, wherein said transducive element is disposed in aheadset unit.
 28. The method of claim 26, wherein said user spokenutterance is transmitted to a base unit via a short distance wirelesscommunications link.
 29. The method of claim 26, further comprising:activating one of said plurality of multimedia units; responsive to saidactivating steps disabling another active one of said plurality ofmultimedia units.
 30. The method of claim 29, said disabling stepcomprising: lowering an audio volume of said another active multimediaunit.
 31. The method of claim 29, said disabling step comprising:pausing operation of said another active multimedia unit.
 32. The methodof claim 29, further comprising: responsive to terminating use of saidactivated one of said plurality of multimedia units, re-enabling saidanother active one of said plurality of multimedia units.
 33. The methodof claim 26, wherein said voice command activates an intercomoperational mode, said method further comprising: receiving a subsequentuser spoken utterance in said base unit from said headset unit via awireless communication link; and forwarding said user spoken utteranceto a second headset unit via a second wireless communication link. 34.The method of claim 26, wherein one of said plurality of multimediaunits is a wireless telephone, said method further comprising: receivinga telephone call in said wireless telephone; and playing an audionotification through said headset unit indicating that said call hasbeen received.
 35. The method of claim 34, wherein said audionotification plays a name associated with said received telephone call.36. The method of claim 34, wherein said audio notification plays acalling telephone number of said received call.
 37. The method of claim26, wherein another one of said plurality of multimedia units is activewhen said telephone call is received, said method further comprising:responsive to receiving said telephone call, disabling said active oneof said plurality of multimedia units.
 38. The method of claim 37, saiddisabling step comprising: lowering an audio volume of said activemultimedia unit.
 39. The method of claim 37, said disabling stepcomprising: pausing operation of said active multimedia unit.
 40. Themethod of claim 37, further comprising: responsive to terminating saidtelephone call, re-enabling said active one of said plurality ofmultimedia units.
 41. The method of claim 34, wherein said audionotification queries a user whether to answer said received telephonecall.
 42. The method of claim 34, further comprising: determining that acalling number for said received telephone call is not included in acontact list; and querying a user whether to save said received callinformation.
 43. The method of claim 26, wherein one of said pluralityof multimedia units is a short distance radio, said method furthercomprising: receiving a short distance radio communication in said shortdistance radio; and providing an audio notification to said headset unitindicating that said short distance radio communication has beenreceived.
 44. The method of claim 43, wherein another one of saidplurality of multimedia units is active when said short distance radiocommunication is received, said method further comprising: responsive toreceiving said short distance radio communication, disabling said activeone of said plurality of multimedia units.
 45. The method of claim 44,said disabling step comprising: lowering an audio volume of said activemultimedia unit.
 46. The method of claim 44, said disabling stepcomprising: pausing operation of said active multimedia unit.
 47. Themethod of claim 44, further comprising: responsive to terminating saidshort distance radio communication, re-enabling said active one of saidplurality of multimedia units.
 48. The method of claim 43, wherein saidaudio notification queries a user whether to answer said received shortdistance radio communication.
 49. The method of claim 26, wherein one ofsaid plurality of multimedia units is a wireless telephone and anotherone of said plurality of multimedia units is a short distance radio,said method further comprising: exchanging audio between said wirelesstelephone and said short distance radio.
 50. A portable multimedia andcommunications device comprising: a transducive element for receivingsound; and a base unit comprising a plurality of multimedia units and aprocessor executing a speech recognition engine for recognizing userspeech, wherein each of said plurality of multimedia units can beselectively enabled and operated responsive to user voice commandsreceived via said transducive element and communicated to said base unitvia a communication link, wherein the base unit is a portable electronicdevice, wherein said plurality of multimedia units comprise at least onemultimedia unit selected from the group consisting of an AM/FM radio, acassette player, a compact disk player, and a digital software-basedmedia player, and wherein each multimedia unit operates in aspeaker-dependent operational mode until being switched to aspeaker-independent operational mode in response to a user spokenutterance.
 51. A method of operating a voice activated, portablemultimedia and communications device comprising: receiving a user spokenutterance in a transducive element; transmitting said user spokenutterance to a base unit via a communication link; speech recognizingsaid user spoken utterance as a valid voice command; selectivelyenabling one of a plurality of multimedia units disposed in said baseunit responsive to identifying said user spoken utterance as said validvoice command, wherein the base unit is a portable electronic device,and wherein said plurality of multimedia units comprise at least onemultimedia units selected from the group consisting of an AM/FM radio, acassette player, a compact disk player, and a digital software-basedmedia player; and operating a selectively-enabled multimedia unit in aspeaker-dependent operational mode until causing the selectively-enablemultimedia unit to operate in a speaker-independent mode in response toa user spoken utterance identified as a valid voice command.